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1., preface 

1*1 Ob.iectives 

The aim of Ihis project is to devise a system for generalising 
lamotely sensed data for areas of the earth's surface at a scale 
apia^opriate to a world data collection system* Successful generalisation 
\d.ll allow the identification of changes in surface cover over 
successive orbits, and may be used for small scale mapping of -these 
changes . 

This system, if satisfactory, could be used as a first stage 
data 1 liter, selecting for further analysis those images which show 
significant changes from earlier imagery. It is desirable that -the 
sys-tem uses data which has undergone the minimum amount of pre- 
processing, and hence bulk (system corrected) ERTS imagery has been 
used throu^out. 

Since it is likely that any operational earth observation satellite 
will, like ERTS-1, return the ii^gery as a digitized picture, there 
are many advantages in devising a system capable of pj^ocessing this 
raw digi-fcal data. Among these advantages aie the reduction in slow 
and costly photographic processing, the potential for developing 
a fully automatic system, and -fche ability to make use of the ftdl 
dynamic range cf the sensors vri-thout the degradation in radiome-feric 
fidelity introduced by photographic processing. Also fu-ture systems 
will possibly have sensors vkilGh do not yield information sid table for 
photographic presentation. 
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1,2 Gengralisation of the iBaaggry 

7 

Each ERTS fraae coatadjis ? z 10 pixels (picture elements) for 

6 

the four MSS haiids, vjhereas a regional map at a scale of, say, 1 ; 10 
2 3 

could show 10 - 10 independent data points in an equivalent area. 

It is thus obvious that some generalisation of the ERTS imagery is 
required for regional scale invest! ^tions. 

This generalisation can be introduced xn two ways. The conventional 
approach is .to classify the imagery into surface "types, and to use 
these surface types as a basis for generalisation. One disadvantage 
of this approach is the expense in computing power required to classify 
each pixel quite apart from the difficulty of recognizing 'standard 
types’ for such classification. An alternati"ve system is to 
generalise the raw data prior to classification and mapping on a regional 
scale. This second approach is adopted for this project, since it offcars 
the capability of efficiently monitoring the imagery on a regional scale.. 

The types of fea"fcure idiich should be significant on this scale 
are, for example, large grass bums in savanna regions, or snow cover 
in the catchment area of a reservoir. It is hoped that the system 
described here could iden-tify this type of change against a background 
noise of smaller scale changes, ” 

1,5 Generalisation Ui^ity 

One of "the aims of tiae project is to deteimiae the area o"vex 
which the data can be effectively generalised and yet still gi"sre a useful 

reiiresenta"ticn of the .image. It is probable that this area will be 

\ 

a function Of the terrain type and the features of interest. 



Initially » square Unit Test Areas (UTAs) covering 2»5 x 10^ 
hectares have been used. These UTAs are defined by the geographical 
coordinates of thedr centre points, and have edges parallel to 
lines of latitude and longitude. In some cases it mi^t be desirable 
to use a UTA coinciding with geografhical boundaries, and to allow 
for this UTAs with polygonal outlines may also be used. 

1.4 Method of Generalisation 

The generalisation method which has been adopted characterizes 
each UTA by its gray scale frequency distribution. The parameters 
used for generalisation could be quantities derived from those 
distributions, such as the mean and the standard deviation, or they 
could be the histogram that represents that distribution. The latter 
will be used in this investigation, because the gray scale histograms 
are the simplest way of handling the information contained in the 
frequency distribution. For most purposes the four MSS bands will 
be laeated independently, but greater sensitivity could be achieved by 
using a frequency distribution in four dimensional measurement space. 

More complex methods of generalisation such as the power spectrum 
of the structure in the image coxxld give parameters related to the 
photogrametric tise of texture. The use of. such alternative means 
of generalisation will be investigated at a later stage of the project. 

1.5 Data Base For this Investigation: 

Two test sites have been chosen, although adequate coverage is 
only available for one of these, which is aai area in the Central 
?alley of California, and includes part of ths Sierra Nevada and 
the Coastal Range, The second test area is in. Eastern England. 



The 70 nnt bulls nogatlvea for each of the scenes including 
part of the test areas have been ezamined, and CCTa obtained for 
4 of the scenes. The centre points, scene identifie's and local 
tape code names for these tapes are listed in table 1. 

1 



NASA 

Centre 



Local Code 

Scene Identifier 

Point 


Late Taken 

TAPE B 

. 1053 - 18114 

5T® 27 'N, 120° 

22 *W 

50 Aug. 75 

TAPE C 

1051 - 10554 

55® N, 0°N 


25 Aug.72 

tape D 

1056 - 18114 

57® 25*N, 120® 

22'W 

17 Sept. 72 

TAPS E 

1508 - 18122 

57® 54*N, 120® 

57 ‘W . 

27 May ,75 


1.6 Sgone :Of this Report 

This is a detailed progress report covering all aspects of the 

investigation during the period hecesber 1972 (when the first CCTs 

were obtained) until February 1974. Significant results are presented 

concerning the accuracy with »diich the UTAs can be located in CCT 

coordinates, and the extent to which small scale structures in the 

* 

gray scale histograms are related to real features on the surface. 

In addition, progress in the development of analytical techniques is 
presented. 



I TTA Location . and Compilation 
2.1 Introduction 

Of primary importance for automatic generalisation of satellite 
liaagery, is the ability (i) to align the digital data to geographical 
coordinates, and (ii) to select those pixels vrhich lie ^d.thin a 
specified target area. 

There are several ways in which this alignment could be made, 
but for an automatic system operating on a global scale, the most 
hopeful in terms of computing time is to use the information available 
on ihe satellite position and attitude. This is the method used by 
NASA for meteorological satellites and for ERTS bulk (system corrected) 
imagery. IVom predicted or measured orbital parameters the ERTS 
processing facility calculates the latitude and longitude of both 
the sub-satellite point and the central point of the image* The 
latter corresponds to the intersection of the principal axis of the 
RW cameras \/ith the earth's surface and depends on the attitude of 
the satellite. These two points are then vised to superimpose a 
latitude-longitude grid on the image. The pre-launch estimates of the 
accuracy of this grid were about 500 m, idiich implies that a 50km square 
UTA could be located idth a ifo error. This is sufficiently accurate 
for the sort of changes that could be detected by this method. 

2.2 Techniques for comrllirg the IFTAs . 

UTAs are defined by the geographical coordinates of their comer 
points, and are compiled from -the CCTs in two stages: 

i) The conversion from latitude - longitude coordinates to scan 
line, pictvire element coordinates (taps coordinates) is performed by 
linear interpolation betvreen the taps coordinates corresponding to the 



intersection of the latitude-longitude grid Kith the edge -of the frame 
using the grid information oontained in ihe annotation record at the 
8i;arx; of the tape. 

, . li) The digital data corresponding to a UTA is compiled by 
selecting from the CCTs those points idiidi fall between lines connecting 
the corner points of the TJTA. The UTA data is then stored as a 
separate data file consisting of segments of scan line of varying length. 

2.3 Results 

The accxiracy of the coordinate conversions performed by this method hav 
been tested using easily identified control points for the Californian 
images (Tapes E, E, and e). The geographical coordinates >rere taken 
off the UaCiS 1 s- 25O, 000 maps, and converted to tape coordinates . A 
section of the image surrounding the tape coordinate was displayed 
using line printer gray-scale plots, which gave an indication of the 
positional errors. 

Figure 1 shows the results obtained for the dam of Honey Lake 
in California. The positions found for the dam on tapes B5 and D3 are 
marked by open circles, and show errors in the range of 5 to 10 km. 

Tests for other parts of these images show that the error has a 
large constant component, and a smaller component which varies across 
the image and has a magnitude of about 1 Ion. ^ 

2.4 Discussion 

The source of the major error is almost entirely in the latit\ide- 
longitude grid data provided by NASA, and similar errors are found 
from the grid superimposed on the photographic products. It is ■ 



possible that the minor error is due to the coordinate conversion 
algorithm. This is particularly likely near the corners of the 
images sdiero grid marks are sparse, and linear Interpolation is 
less reliable because of geometric distortion. 

•• These errors in the positioning of the digital data relative 
to the OTAs are too large to allow reliable monitoring for changes 
in the imagery. 

Two solutions are possible for the purposes of this investigation. 
Either Ihe UTAs can be defined purely in terms of tape coordinates 
and subsequent images aligned by use of a correction which initially 
would have to be determined manually, or the geographical 
coordinates could be converted to tape coordinates with due allowance 

made for the error in the grid. The advantage of the first is thv; it 

) 

removes the need for unpacking the annotation record, and so reduces 
the core store required for programs, which would bring useful savings 
in computer xisage. The second alternative allows geographically 
significant (or at least interesting) areas to be used more easily, 
but at the expense of increased computer iisage. 

An interesting ezcercise in tliis context is the vise of a 
cross-correlation technique to align one image with another. This is 
a standard technique in crystallographic and in biological image 
processing where the image consists of well defined objects seen against 
a uniform background, but it is not immediately obvious that it will be 
successful in alibiing remotely sensed images where there is no clear 
distinction betv/een object and background. Density slicing is one 



means of reducing images to a collection of objects on a unifona 
backgrovind, and this offers the most promising line of action. This 
approach is currently being pvirsued. 

Alternatively, the tape coordinates of features recognisable on 
the photographs could be taken directly from a coordinate gria 
superimposed on the photograph, provided that the tape and jiiotographic 
imagery are coincident. However, it appears that there is an 
along- track displacement of tape imagery relative to the photographic 
imagery by as. much as 8km, and so there is no advantage in using thdLs 
method. The relative displacement of the two types of imagery has 
been found for all cases fear ^ich we have CCTs. 

2.5 Development of the Comnuter System 

The following operational programs for handling CCTs eiid compiling 
UTAs have been developed. 

1. IDREAD Reads and \mpacks an annotation record in the 5KTS 
CCT Format, 

2. ANNRD Reads and unpacks an annotation record, including 
the latitude-longitude grid data. 

3. heads Produces a line printer listing of the relevant 
information in identification and annotation records. 

4o BULKll J- Constructs the mask for compiling sqxmire UTA's xd.th 
sides parallel to latitude and longitude grid, 

5, BDLK12 s- Compiles a UTA using the masks generated by either 
BULm or BUUQ5, 

B0LK15 ,J- Constructs the mask for compiling polygonal UTA’s 
with a maximum of eight sides. 


6 . 

i 
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In addition, routines EULIQ5 and BULK14 have been witten for use 
as self contained prograias for compiling histograms (smoothed and 
unsmoothedj, and for contouring on the graph plotter rectangular 
areas of the images defined by tape coordinate boundaries. 

Apart from these larger segments, several service subprograms 
are available, including routines for unpacking and packing the digital data 
into words as used on the original! CCT's, routines for converting the 
CCT'S to ICL readable format and for producing grey scale picttres on 
the line printer. 

2*6 Disnlav Techniques 

In crdor to locate recognisable ground features in terms of 
tape coordinates, some foimi of display of ihe data on the tapes is 
needed. Two forms of picture display are available ^diich are to some 
extent complementary, and both vdll be maintained. 

i) Line printer grey scale pictures simulate the photographic 
Imagery, and tiiereby provide a rapid display of the data. However, they 
suffer ftom the disadvantages of mapping at too small a scale, and of 
a limited range of gray tones. For speed, no allowance is made for the 
skew on the imagery, or for the distortions imposed by the use of a 
standard line printer, althou^ in principle this could be done. 

Therefore, the resvilting map is not geometrically accurate, and is not - 

comparable , in the overlay sense, with conventional maps. 

ii; Contotu? maps of the gray scales drara by the graph plotter have 
corrections for the skew and sampling effects, and have an easily variable 
scale. In addition there is a wider rang© of gray scale information 



available « 


The dieadvantages are the time taken to produce a single display, 
the diffictilty in ouu touring areas of rapidly changing grey tone and the 
lack of an inmediate visiml interpretation Of tJ® display. 

Figure 1, provides an example of a gray scale printout and 
Figure 2 shows a sample contoiir map. 
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3* Gray Scale Samplina^ 

3.1 Introduction > 

Radiometric errors in the OCX imagery impose limitations on the 
use of the gray scale histograms. It is necessary to investigate 
the magnitude of those errors, so that the significance of particular 
differences hetwetm histograms can he assessed. 

The soirees of radiometric error to be investigated are:- 
i) those related to the conversion from radiance to gray scale 
levels on the CCTs 

ii) those caused by the uneven response of the sensors. 

3.2 The MSS Transfer Function 

A detailed description of the MSS transfer function, which, determines 
the conversion f^oa radiance to gray-scale values is given in the section ' 
"System Performance" in the ERTS Data Users* Handbook, A brief 
summary will be given here. 

The radiance, R, to tape count (gray scale values on the CCT), T, 
transfer function is always linear, but as will be seen, the a priori 
probability of any pixel tiaving a tape count T is a fimction of T itself: 
that is, the probability distribution (as opposed to the frequency distribution) ^ 
is a non-unifonn function of T, The cause of thj.a non-uniformity is the 
discrete nature of the gray scales, and becomes more apparent when the / 
intermediate transfer functions, f]L hetvreen R and the sensor count, S, and 
f 2 between S and T, are considered. 

For band 7 alone , the transfer function f^ is linear and S has a 
range of 0 to 63, The second transfer fimctlon which is related 
to th® processing of the raif sensor data by the Special Processing 
Subsystem (SPS) , is also linear and T has the same range as S, iimong 



the operations perf orssiod by SPS is radiometric calibration of the 
sensor data and this could cause to differ from a one-to-one 
mapping of S onto T. If for eiample two different sensor coimts» 
and S2, are always mapped into a sin^e value, then the 

6 urjori probability of obtainin.g will be txd.ce the mean 
probability. 

The non-uniformity introduced by this effect will be spread 
over adjacent gray levels only and could be removed by smoothing 
with a simple "tunning mean technique* In practice the calibration 
function changes continxially, and this smoothing does not appear to 
be necessary* This is shown by Pig* 3 vdiich is "the unsaoothed band 

7 histogram for a 50 km square area' in ihe central valley in California* 

The sensors for bands 4, 5 and 6 are usually operated in ihe 
compressed data acquisition mode, in vdiidi the sensor transfer function 
f^ is non-linear, with the sensor count scale compressed for high 
radiance values* The range of S is again 0^S^3* To correct for the 
non-linearity of f^, the SPS transfer function is also non-linear, 
and can be approximated by a quadratic of the form 

T = 0.025 + 0.45 S (l) 

This gives a tape count scale which is linear idth a range O.^T^127, 
or ttdce tlie range of S. Because of this difference in the ranges of S 
and T, the transfer function f^ cannot be a one-to-one mapping; V/ithout 



the smearing introduced by the changing calibration function which 
modifies equation (l), the probability of obtaiiring at least half 
of the T values would be zero* Becaixse of 13ie non-lineaiity low 
T values ^dll correspond to 2 or more S values, whereas at the high 
end of the radiance scale only one in every two or three T values 
wotild have a non zero probability. This is illustrated by Figure 4 
v/hich shows the mapping of the 64 S values into T, on the assumption 
that f 2 is represented exactly by (l). This distribution shows that 
the ^ priori probability of obtaining T = 0 is approximately six times 
greater, on average, than that of obtaining any particular value of 
T greater than 120. 

The Effect of the non-xmifor:^ probability on the histogram . 

The grey scale histograms derived from the imagery are the product 
of this non~uniform probability distribution and the actual grey scale 
frequency distribution. The effect on the histograms is illustrated by 
Figure 5o Here each line shows the histogram for 12 scan lines of the 
band 5 for Tape D3. The tape counts have been combined in pairs to 
give a range of 64 values. One of the most prominent features of the 
histograms are the col\>mns with anoaolously low frequencies which persist 
over several sets of 12 scan lines, and which are associated with ^ey 
levels for which the ^ •priori probability is low or zero . 

Because -the exact form of -{he napping from S to T changes from 
scan line to scan line, and from date to date, the lower limit of the 
detail in the his'tograms which can be accepted as real is approximately 



the range of T \Aiidi corresponds to a single value of S 


Because 


of Iho non-linearity of fg» this is a functicai of T, i.e* smaller 
changes can he detected in iJie radiance of dark features than can he 
detected in brighter features. 

The irregularity of the histograms can he reduced, and at the 
same time the redvaidancy in the grey scale values due to the 
.ipanslon of fee rango can bo romoved by smoothing tho 

histograms vn.th a suitable convolution function. The aim of this 
convolution is to derive a 64 level scale for vdiich each level has 
an equal a •priori probability. The probability of obtaining a 
value betifeen T and T +AT is related to the probability of obtaining 
a value between S and S +As by 

, ftT).CiT= I? 

¥ot a uniform distribution in S, this reduces to 

plT).fcT= -h.- (3) 

Differentiating (l) yields 

z: 0 : 0 S 0 • 

which, on substituting for S gives 

• I ' 0’^5'yTH'40T (4) 

The width of tiie 64 intervals in T which have equal a priori 
probabilities is found from ( 5 ) and (4) 

, = o-or/si-i-jtCT ' 

The centre points of these intervals is given by 

T. i\T. 


T 

T? 


( 6 ) 



Preliminary tests have been made xjsing a trapezoidal 
convolution function which has a width at half height given by (5)* 
outer edges idth sloi^es of +1 and - 1, and an eqxial \^eighting for 
all points near the centre of the interval (Pig.6) 

This convolution function has ihe advantages that it is smoothly 
varying over the range of T, and allows for the one-to-two or one- 
to-three mapping at high T values by wei^ting eqvially all values 
near the centre of the window* Figure 7 shows a histogram both 
unsmoothed and smoothed using this convolution function. The full 128 
T values have been used in plotting the histogram, although at most 
64 of them are independent , and a 64 point non-linear scale teiuld 
be more approprate, 

3.4 Identification of Scanner Irregularities 

The multi-spectral scanner that produces the radiometric data has 
6 independent sensors in each band, and thus any unevenness in the 
response of those sensors will produce a banding effect on the image 
which would repeat every 6 lines. In an effort to identify the 
magnitude of this effect, data from the CCTs were subjected to the 
technique of power spectrum analysis* 

This technique of analysis takes a given sequence of data and 
reduces it to a series of wave forma. Each imve, which is of the fc<rm 
of a sine wave, is characterised by two parameters, a wavelength (the 
distance between tt?o successive peaks), and an amplitude (the height 
of -aio ^^av 0 form). The power associated with each wavelength is a 
measure of the contribution of that wavelength to the total signal. 



Thus any wavelength corresponding to a pronounced handing effect will 
plot as a peak on the graph of power against vravelength, which will 
thus identify strong periodicities present in the data. 

The analysis was implemented by use of the program published 
by Davis (l973). llie data input vreis a string of gray-scale values 
at right angles to the scan lines, parallel to the flight path of the 
'satellite, with a sampling interval of one scan 3ine, Thus any 
periodicities with a wavelength of 6 times the sampling interval 
could be associated with unevenness in the sensors. 

The resulting power spectra for all four spectral bands were 
calculated for several images over lengths of 300 scan lines. The 
results for area of the North Sea (Tape C4) and for an area of the 
Sierra Nevada of California (Tape D3) are shown as Kgures 8 and 9. 

Results, not shora hare, were obtained for adjacent scan positions, 
which demonstrated that the spectra were identical or very similar 
if a small liorizontal shift was applied. The resultant plots have 
been smoothed by application of the Hanning triangular window (Davia 
1973)* and poorer is plotted on a logaritlwric scale,- 

The results for the \fash image (Piguro 8) , sho-ws that there is a 
distinct paik near the wa-velength of 6 scan lines, identifying the 
unevenness of the sensors. On this image all the values are very low, 
reflecting the fact that the image consists of open sea which is 
uniformly dark. Becai^se of the low input values the res\iltant 
spectra are very noisy, Hovrever, the peak at wavelength 6 is markedly larger 
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than the other peata, which are associated with noise. The hei^t 
of this peak decreases as the mean radiance increases, implying that 
the magnitude of the mevennees is similar on all bands, but that its 
relative importance varies inversely with radiance. The graph 
suggests that the variability attributable to this effect ds of the 
order of to 1 gray scale value. 

The results for the Sierra Hevada (ligure 9) show the power spectra 
in liie presence of a much stronger signal. The parallelism of the spectra 
for the four bands shoifs that, for this image, most of the power is 
related to. sitrface features. Once again a peak is visible at a 
wavelength of 6 , and suggests an irregularity of about 1 gray scale 
value, that can be attributed to the scanner. 

It has been shown that both the non-uniformity of the radiance 
to gray scale transfer function, and the use of 6 independent sensors 
for each spectral band, cause detectable effects in the iiaageiy. 

The irregularity of the histograms is accotmted for by the non- 
uniform distribution of .a .priori probabilities of gray scale values. 

This is most significant at hi^ radiance values, where it is equivalent 
to an imcertainty of about 3 in the gray scale. 

The banding introduced by the unequal responses of the 6 sensors 
Causes an uncertainty of about 1 in the gray scale. This is most 
significant at low radiance values where the errors introduced by 
the transfer function are small. 
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These effects impose a lower limit of 2 gray scale to1u©s in. 
the width of the classes -which can reliably be used in preparing and 
comparing gray scale distribution histograms^ An algorithm Ijas 
been suggested which divides -the full range of T into 64 classes with 
equal a priori probabili-ty , but the importance of sensor ttnevenness at 
low radiance levels implies that 32 classes -woxild be more suitable « 
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4. The .toalvsis of Gray-Scg4Q HiatograiHS 
4.1 latrMuotiosa 

Initial examinaticHi of histograms for seTreral areas showed 
that there was a possibility of equating components of the histogram 
with surface types. Thus the possibility of splitting histograms^ 
and thereby deriving estimates of the percentage of any image covered 
by each surface type was investigated. 

The investigation concentrated initially on an area ^diich shovred 
a simple biraodal histogram, vihich could be readily interpreted. 

This area xfes part of image C4, which covered part of eastern En^and 
and the adjacent sea area. The land in this area consists mainly of 
small fields of the same order of magnitude, or smaller, than the 
70 m resolution of the scanner, and thus the overall impression given 
by liie land is of a uniform gray speckle. It was thus not expected that 
this technique wovild be able to distinguish between different areas on 
the land. In contrast, the sea on the image shows up as a dark area, 
visible on all foxir bands, althou^^ the greatest contrast is offered 
by band 7, in xjhich the gray scale values for the sea are nearly all 
either 0, 1, or 2. Ho-wever, on bands 4 and 5 the considerable sufeas of 
mud-flat, turbid x/ater and sand bank are readily visible. It xras 
hoped that fear these bands it might be possible to split the histogram 
into three components, thereby identifying proportions not only of 
land and sea but also of turbid water, Hox^ever, the liistograms 
x^re in fact monomodal shoxdLng that the tw distributions that could 
be identified on bands 6 and 7 x/ere overlapping, and that it was not 
possible to accurately identify even tx-?o components of the histogram. 



A. 2 Technique adopted 

A large num'ber of techniques have been developed for splitting an 
observed frequency distribution into the sum of tuo or more component 
distributions. Clark (pers.comm.) has reviewed these methods » and 
recognised three main groups; graphical » analytical, and numerical. 

Gi'aphical techniques rely on the subjective interpretation 
of discontinuities in the cumulative frequency curve, and are thus 
subject to a high degree of operator error. Analytical techniques 
are exceptionally difficult to calculate, and do not on every 
occasion yield a solution. Niacerical techniques are much more 
varied, but nearly all use iterative metiiods to improve upon initial 
estimates of parameters describing the component populations. They 
are thus suitable for incliision in an automated data handling system. 

Of the several alternatives available, it was decided to adopt 
that of Jones and James (l972), which uses a maximum likelihood 
method of estimation. The main reason for this decision were as 
follows : 

a) Although initial estimates of the parameters to be estimated 
are reqviired, the method will converge to a stable solution even 

if these are considerably in error, and it wotild thus always be 
possible to provide a complete null set of initial estimates, 

b) The published work included a F©TRAH program, and so 

made it possible to examine tbs capabilities of this approach without 
the investment of a large pi'cgrammiag effort at this exploratory stage. 



c) It is possible to extend the existing program to include types 
of ccmponeat distribution oilier than the Kormal. The published •work 
includes both "file circular Normal dlS"tribu’fcion, and 'the mechanism 
for adding other distributions. One major disad-vantage is 
that at present the existing program will handle only tvio components. 

4.3. Brl.e.£_Outline of the Haximua Likelihood Method 

Tls maximum lilcelihood method relies on 'the fact that for any 
observa'tion set, X, and a s'tatistical model with parameter set 0, 
it is possible to calculate •Uie a nosteriori probability of X occurring 
as the outcome of the stochastic process defined by the model. 

Thus for any model it is possible to find, in principle, a parameter 
set & that \fill maximise the a -posteriori probability, or likelihood, 
L, of the observation set X. 

In the case of a set of parameters refering to a mixture of 

distributions, the model becomes: 

* til 

Let the j - underlying probability distribution be fj (xi), 

where j = 1, m, and the subscript i, i = 1 n refers to 

the observations. Thus for each component distribution, the 
probabili-ty of -the observation set is: 

Pi 

If the proportion of each component distribution is then the 
total a p oateriori probability of the observation, set is given by 

m n 

J 
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Usually, however, it is easier to iTausform these probabilities into 

their logarithms, and replace the multiplication by an addition, so that 

the transformed expression to be maximized becomes 

m 




'^3 '♦'i 




The parameter set 6 for the model consists of (m-l) values of q , 

J 

and Ihe parameters for each of tiie sub popxilatioas f ^ . Note that 
because the Qj's must sum to unity, (m - 1) values fully determine set. 
In the case to be considered here, the underlying distributions 
are all considered to be Normal, and thtts have the form 


where and <f are the mean and standard deviation. V/here there 
are two such distributions, only a single proportionality parameter 
is reqidred, which we will call CX« • 

Thus the total statistical model to be considered is 


r-’ 


» CxjjZTrV. 


Z C-,' JJ Z (T/ 


And the parameter set thus consists of the five parameters 

0 -- { , <^r] 

Because the derivatives are non-linear, it is not possible to reach, 
the maximum likelihood solution by direct evaluation, but it is necessary 
to use a numerical technique. The Jones & James FTogram uses the 
steepest ascent method, to provide a rapid convergence towards the maximum, 
and thereafter clianges to the Newton Raphson method which is superior 
near that max imuin, but also more sensitive to poor initial estimates. 



4 ♦ 4 Application to thei^oblem 


In order to be able to use the maximim likelihood method 
it is necessary to have an observatton set» rather ttem a grouped 
histogram. Eowovery the his1»grams of any one area are the result 
of the examination of a large number of pixels, and it ras thus 
considered convenient to convert the histogram to a reduced observation 
set, purely as a form of data reduction. Secause of the integer 
natire of the gray-scale values, it was possible to reconstruct an 
observation set with approximately the same proportion of values at 
each gray scale level as in the original data set. The progra® 
set up to accept a data set of up to 200 points, which was constructed 
f^om histograms typically derived from 80,000 points. Repeated analysis 
using a larger data set of 400 points yielded almost identical component 
distributions, confirming that this data reduction does not distort 
the results • 

The efficiency of the program tma improved by the use of integer, 
rather than real, arrays for storing Ihe data set. Although this 
reqiiired the use of the FLOAT operation for calculations of the 
probability function, it permits a more efficient storing of the data. 

A histogram drawing routine was provided, which used the line printer 
to give a plot of the input histogram, the histogram derived from the 
initial estimates, and Ihe histogram fitted by the program. 
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4o5 

Results were o'btained from the program for histograms derived 
from all 4 spectral hsuidsc The data from hand 7 yielded the clearest 
resviltSf since tiie bimodality of that histogram was extreme, vTith 
virtually no overlap between the two components. Initial estimates 
of the parameters >rere chosen visually, and a total of 30 iterations 
were required to produce stable estimates. A sample output is 
included as figmre 10, and a summary of the results for band 6 and 
7 is given below: 

Results of Likelihood Estimation of Parameters 

of Mixed Monaal Distributions 


Parameter 

Band 7 ! 

Band 6 1 

Initial 

Pinal 

Initial 

Pinal 

Proportion of 

n 

pop- 

1 

.7 

. 

0.717 

0.7 

0.769 

mean 

of 

tt 

1 

1.0 

1.255 

3.0 

2,856 

s.d. 

of 

ft 

1 

1.0 

0.712 

.4 

1.478 

mean 

Of 

M 

2 

23.0 

22.829 

20.0 

20.562 

s «d.« 

of 

tt 

2 

j 5.0 

1 

5.576 

3.0 

3.36 


As can be seen from the table, for band 7 population 1, which Vfas 
identified as the sea component, covers lloTfi of -ttie area. The corresponding 
estimate for band 6 ho\yever, was hi^er, at 

The results for bands 4 and 5 Ttere considerably less encouraging. 

The likelihood response surface proved to loe very flat, and th\is no 
stable solution seemed possible. In particular, the estimates of iho 
standard deviations seemed most unstable. 
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Furthermore the moat stable parameter* "fee proportion of tlie 
data in population 1 converged towards a value greater than onoo In 
a different context such a result, and the corresponding negative 
proportion for population 2 can be interpreted as representing 
a -single parent population from which a distinctive sub-population 
has been removedo However, in this case the concept of a negative 
(subtractive) population has no interpretable meaning, and the 
results are thus of little value. 

4.6 Discussion 

This pilot study has sho^m that the automated splitting of 
gray-scale histograms is possible, and can yield useful results. 

It has also shown that the resxilts yield discrepancies betvjeen bands which 
were s\xfficient to cast doubt upon the applicability of the method 
in general. In the li^t of these considerations, and the considerable 
computational tune required to perform the analysis, it was felt that 
this approach, while possibly iisefixl in specific instances, was not 
s^^i table for routine application to ERTS data. 

The alternative histogram splitting techniques tliat are available 
mi^it produce results more rapidly, and be more readily ap^plied. In 
particular methods that can handle more than two components need to be 
examined. However, at this exploratozy stage it is necessary to make 
considerable checks on the input hiatograms, to correlate these ifith 
ground truth, 

« 

At tlie time of this part of Uie study, the prooiem of correlating 
tape coordinates with geograpliical areas, had not been solved. Thus 
the identification of liie spectral responses of land types necessary 



for the interpretation of the histograms cotild not he pursued. This 
technique was not therefore tested further until more background . 
information became available. 

It is however pertinent to consider the question x^hettier other 
analyses might not yield similar results as cheaply. In particular 
if it were possible to map ground types, it would be a simple matter 
to estimate Iheir area. The possibility of lasing either cluster analysis 
to groiq) poijits (or collections of points) into hopefully recognisable 
groups, or some fom of discriminant function to map points onto 
^ 'Priori groups might be equally \jseful. Some of these techniques 
are discussed below. * ' 
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5. The Use of Clusterin g Tedmiauea 

f . ‘ 

5.1 Introductj.on 

In an attempt to produce a generalised map of the ERTS digital 
data the possibility of using cluster analysis was investigated. 

This form of analysis would group lifco areas, aiHi hopefully each 
cluster would be identified with a land type. Clustering using 
the spectral signature of Individual points has been used by 
many workers (eg, LARS, 1368), 

Because of the very large amovoits of information available 
from one ERTS CCT, it is useful to set up some intermediary units 
that represent an aggregated data points, which become the Operational 
Taxonomic Units (OTUs) which are the input to the clustering procediare. 

Once tl^ey have been designated, it becomes possible to collect data for these 
units from the digital tapes. These tmits are then input into a 
clustering algorithm which produces as an output a map of land types 
for a large area. There are thus three basic steps in the anals’^sis. 

a) Setting up the OTUs, 

b) Selecting the vaariables to be used, 

c) Producing the clusters, 

5.2 

The size of the OTUs considered has •the effect of setting a scale 
for "the resultant map, whereas tbeir number determines "the computational 
effort retjuired. 
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Setting aside, for a while, the computational problems, several effects 
can be noted idiich are related to the size of the OTUss 

1) A lower bound for the size of the OTTJ is set when each OTU, 
is a single pisel* At this scale, it is probable that the 'noise* 

r 

present in the picture trould be sufficient to mask any regional patterns 
that nd^t be detected. 

2) As the size of the OTU increases, so does the amoiait of information 
available about its characteristics » The ccmplete information is given 

by the full set of gray scale values, spatially ordered, and this information 
amount obviously Ti^comes very large as the OTU increases in size. This 
full data set contains much information that is not relevant at a 
regional scale, and it is thus necessary to derive a variable set tdiich 
effectively characterises that information. Obviously the lai’ger the 
OTU the easier it is to derive an efficient variable set, For example 
attempts to measiire texture require that each OTU contains a fairly 
large number of pixels, 

5) As the size of the OTUs increase, so the resolution of the 
final rmap diminishes. Thus boundaries between land lypes can be 
more accurately, but more expensively mapped if the OTUs oxe small, but 
on the other hand larger OTUs permit a classification on a much larger 

*i 

areal scale. 

4) As the size of the OTUs decrease, so their number increases and 
so liie final classification may alter. In particular, small OTUs may, 
by chance, contain anomalous pixels, which would result in some OTUs 
remaining outside nearly all the clusters mtil very late in the clustering 
procedure. Large OTUs tend to reduce the msalSir of these ’Kaverick* 



points, since they would "be included in, and hopefully *swanped out’ 
by larger niuabsi^s of normal pixels. 

With these consideratiotts in mind, the problem of classifying a 
significant reaction of an SHTS image vras considered. Initially the 
tar^t area to be clustered was set at of an image, that is tiie data 
contained on -i- of one CCT. This represents 810 x 500 pixels, tdiich is 
about 50km x 501 q 2, For the purposes of initial inveatigation OTUs 
were arbitrarily set a size 20 x 25 pixels. This size of OT0 has 
the advantages that:- 

a) Each OTU contains 500 pixels, which is sufficient to permit 
the evaluation of all parameters that mght be considered. In 
particular it is sufficient to give relatively sBuooth gray-scale 
histograms (see section 5, abov«»). 

b) The target image thus consisted of a matrix of 40 x 20 OTWs 
which was felt to be sufficient to produce a fairly detailed map. 
However, the 800 OTUs thus produced are more ihan it is feasible to 
use in a clustering algorithm which requires the entire interdistance 
matrix, since this would require the storing and repeated reaccessing 
of 640,000 distances. Alternatively by the use of algorithms that 
reqidre only portions of the interdistance matrix, it will be possible 
to cluster Ihese 800 points. However, the consideration of a smaller 

, portioii of •the image as a training set is also feasibl©. 

c) A subsidiary advaivfcage of these rectangular OTUs is that 
•they can be plotted on a computer line printer wl-thoiot geometric 
dis-tortion if so required. It must be remembered that thi.s size 



of OTU j.Sf Ixoweverj ossontially arbitrairy^ and was adoptcid puroly to 
fiiakd OTUs availabXo fccp an initial investigation* A full examination 
of the effects of adopting other sises of OTU must be undertaken. 
However, it is obvious that the choice of OTU size can be nade only 
in rea.ation to an _a ^r4or:|) objective. This choice, however, requires 
background kncnrledg© of the effect of changing OTU size. 

5*3 The Variable 

bith an OTU containing 500 pixels, each of v/hich contains 
a giay^scale value in 4 spectral bands, some sort of summarising 
procedure is necessary.. Several possibilities seem available. 

i) Summary statistics traditionally used to describe 
frequency distributions could be used. 

The measures that seem most suitable would be tiie mean, the 
standard deviation, and perhaps the sketmess and the kxirtosis. 

Taking the ^ean and standard deviation wuld reduce the 500 pixels 
to an 8 Variable set, and thus achieve a great compression and 
eceonomy. However because of this very great compression it >ra.s 
decided not to txae ttiis variable set in the first instance, but 
to use a more comprehensive variable set, and then to determine if 
such a reduced set woiad yield the same results. If it turned 
out lihat this reduced set would yield satisfactory results, its 



efficiency would certainly lead to its adoption. The xise of this 
z'educed set iri.ll "be ©rained in detail at a later stage. 

ii) the variable set adopted for initial consideration consisted 
of the information oontained in the gray-scale histograms. 

The gray scales for the 4 bands were all split into a series of 
classes, and the number of observations within each of these classes 
computed* The number of pixels within each of these gray scale classes 
was thus taken as the variable set for inpirt to the clustering algorithm 
A first stvidy split the gray sc.al^. to a 32 variable set, with 8 equal 
classes for each sj>octral band. 

For further investigations, the gray scale vrill be split into 
16 classes per band, thereby producing a 64 variable set. It 
is hoped tliat this very large number of variables will contadn 
nearly all the. inf ormation available at this scale, and will contain 
considerable redundancy. It is then planned to examine the 
redundancy present, in an attempt to produce a reduced variable set® 

This may be achieved by use of a stepwise, discriminant analysis, 
which will arrange the variables in their order of importance in 
determining the groupings. Other variable sets will also be compa3?ed 
vrith results from this full set. In Hiis way it is hoped to be able 
to use this extremely full variable set to act as a yardstick against 



which to measure the perfonaance of smaller variable sets* 

■ iii) Alternative variable sets can be devised which rely on 
more sophisticated transfoms of the data* 

For instance, the ratios of signals in varioixs bands have 
been found useful by some workers for specific instances, and 
this kind of variable might prove valuable in clustering for 
those special purposes* A second instance tliat is being considered 
is the use of a Foxirier transform to measure texture, and then 
to input the coefficients of the 2 dimensional Fourier transform 
in an attempt to produce clxxsters based on. textural properties* 

Other data transforms such as the Eadamard and Walsh transform 
(Harmuth, 1969) might also be used. 

A last possibility that might be examined is the use of a 
techriiq^ue such as principal components to produce a reduced variable 
set* This Moxild require the examination of the entire image to 
produce the components and then to cluster on a reduced set of 
component scores 

5*4 The Clustering Program 

The data for the OTTJs are then input into a clustering 
algorithm which produces groups of points in the vari9,bls space, 



which hopefully can he identified with land types. All available 
clustering algorithms use some measure of similarity (or dissimilarity) 
/between the OTUs in the variable space, and most require that the 
entire interdistance matrix be stored. The clustering algorithm 
then selects the two most similar points in the variable space 
(\diich is usually assumed to be Euclidean). These two points 
are then replaced by a single point, usually at the weighted 
mean of the tvro, and the distance to all remaining points 
recalculated. The processes is then repeated on the reduced 
matrix of interdistances, until only a single group remains. This 
algorithm is due, in essence, to 'I'iard (1964), and many programs 
have been published which offer varieties of this basic algorithm. 

For the purposes of initial experiiaentation the program published 
by Veldman (1967) -imis used, since this was readily available, 
and had already been used in a similar situation by (hfen Jones and 
Custance (l974), Hmfever it is suspected that more efficient 

n ■ 

algorithms are available, par tic\ilarly those that require that 
only one half of the symmetric similarity matrix be stored vdthin 
the macliinso The possibilities of using other programs will thus 
have to be exjjlored if this technique is finally adopted for use . 



5.5 Prelimlaary Results 

A preliffllnary study used a block of 5 x 5 OTOs, located on 
the eastern edge of image D3 in the Sierra Hevada of California, 

• • The block of observations ran from positions 1 to 100 
along the scan line, and used scan lines 126 to 250. The data 
^fere expressed in a 32 variable ^ace, so that each of tiie 
eqizal classes represented 16 gray seal© values in bands 4 to 6, 
and 8 gray scale values in band 7. 

These 25 points were clustered using the Veldman program* 
and a dendogram drawn from the results (fig.ll). The maps 
derived frem the last 4 steps are shown as fig. 12, and the 
loss of information du® to clustering shown as a function of the 
number of clusters in fig. 13. 

There seems to be some Justification for thinking there is 
a distinct Jump in the error loss function after ihe 21st iteration 
ie. v<dth 4 remaining groups. At this level the map of the 
resultant groups is beginning to show coherent spatial units, which 
retain theii* general pattern through the subsequent clusteriaigs. 

Figure 12 shov?s that the dusters form spatially compact 
units, and produce a thoroughly credible pattern. The roughlj^ 
diagonal arrangement is parallel to the prominent ridge and valley 
structure of the Sierra Kevada. 

Although these initial results seen reasonable, this test is 
very preljjninary, and must be supplemented by much more detailed 
examination of the, clusters produced from a .larger data set, and 
then checked against ground truth. 



It is however very encouragiiag that the clusters are spatially 
coherent, even though their spatial properties are not included 
in the variahlo set. This is in contrast with the results of 
■‘Owen-Jones and Ciistance (l974) who fotmd that without aPPlyi^^ 
contiguity constraint, the resultant clusters were not coherent, 

5,6 D iscu ssion 

Results so far show that clxjstering of aggregated units using the 
gray scale histogram is a feasible procedure, and that coherent 
groups can be produced without the imposition of the contiguity 
constraint. However, the quality, of the results must be tested against 
a larger data set. 

The computational aspects of the application, however, indicate 

■|. V. . ■ 

that the program id,!! only be useful for clustering small numbers of 

OTUs, By storing the entire interdistance matrix, the program 

2 

requires sufficient core to hold an H matrix to cluster I points, 
which must then be scanned .H times. However, it is possible to use 
clustering techniques that use considerably smaller amounts of 
core storage. For example the single-link method has been iTograiittaed , 
by Sibson (1973) in such a way as to require of the order of 4. H words 
of store, and a contiguity constrained algorithm has been presented 
'by Openshaw (1974) vjhich requires similarly sjnall amounts of store, 
■Both these algorithms are currently being investigated with a View to 
their implementation. 
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Of the severaD. teclmiqites ara5.1able for the computer proceeding 
of pictures, one of the most fundamental approaches is to split the 
picture into non'-overlapping zones by the detection of edges between 
approximately uniform areas (hosenfeld, 1969 ), There have thus been a 
considerable nmber of techniques developed for detecting* edges in 
pictures* 

It was decided to apply some of tiiese techniques to the 
BRTS Imagery, in an attempt to identify the kinds of feature that 
could he distinguished by tteir sharp edges* In particular it was 
felt that this technique would be useful for identifying areas 
on the tape where readily identifiable features, such as lakes, 
are lacking* TJie technique would thus be potentially very useful 
in the irrigated zone of the Central Valley of California, where 
large spatial units occur in a regular rectangiilar pattern. 

Commonly, edge detection procedures consist of two steps, the 
first an initial detection of potential edges, and the second a process 
designed to ’clean up’ the resultant image. Such a scheme will be 
adopted here, though at present only the first step has been implemented. 
The standard technique in finding edges is to locate the places There 
there is kn abrupt change in the pictTxre function (i*e. in Ihe gray 
scale values). This is performed by examining the local elope of tlie 
picture function, and where this slope exceeds a given tlireshold value, 
an ’edge’, is detected. Various more sophisticated filters have been 



proposed, (eg. ty Rosenfeld 1970 ), tjhicli examine the differences 
between average values either side of the potential edge, or which apply 
matched filters to each po.int in turn. However, for the purposes of 
this investigation it was felt initially that a fairly crude definition 
of an edge would suffice, and that any cleaning up of the picture 
that this would require could be carried out subsequently. 

The operational procedure adopted was to examine each point in turn, 
and wiiero the gradient at "this point exceeded a threshold value, to 
demark that point as belonging to an edge. Thus an initial program was 
Tjritten vrliich calculated the masimum slope at each point, using the 
function : 

, T)^ = L- 

l/here is the gray scale value at point This valr© 

is thus the maxim\mi difference between each point and its 
eight adjacent neighbovirs. 

6,3 Ke suits ^ 

A progi'am was written to calculate these values of » 
and produce a binary image of edge points. This has been applied 
to a portion of . the central valley of California (Tape D3). Pig 14 
shovis a gray scale print out of a portion of the analysed area, and 
Pig. 15 the edges detected in that area. 

As can be seen, this operation results in edges idiich closely 
correspond \d.th the field jattern as revealed in the gray scale printout 
but has tb.e unfortiuiate property that all edges are double, because the 
programme has identified each edge ttjice. This effect can be explained 
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by reference to the one dimensional case. If vre scan along a line, 
and find a large difference between points i and i + 1, point i vri.ll be 
labelled an edge, Hovrever this same difference will also be located 
when examining point i + 1, so that both points i and i + 1 are 
labelled as belonging to an edge. 

To circumvent this problem, differences v/ere calculated only for 
one quadrant of directions, so that only three adjacent points were 
examined, Thujs the edge identifying function becomes 



Tvro thresholds were used for the plotting of V/here was greater 

than 6, a dot was plotted, and v;here greater than 12 an asterisk v;as 
plotted. A result of this program, for the same area as for the other 
two pictures is given in fig, 16. Hote that a much cleaner image 
results, and that very few edges are actually lost, 

6.4 Discussion 

Obviously this work is still at a vsopy prelajninary stage, and much more, 
needs to be done, Eovrever, the results of these first attempts are 
sufficiently promising to indicate that this line of approach is valuable. 
The following tasks are seen as important, 

V ‘ ft 

a) the investigation of the use of different thresholds. 

b) the drawing of these maps using all fovxr spectral bands, both 
as four separate maps, and as one single, multispectral map. 

c) 'the development of a routine vjhicb. xfill ’clean up’ the resiultant 
binary isiage. This could be done by removing all points not 
surrounded by at least n adjacent ’black* elements, Mhere n is 
some threshold value. Work by Yamada and Parango (1965) suggests 
that a value of n - 4 would be sufficient, but examination of the 



concept of a continuous linear edge suggests n = 5 would 
be more appropriate. Obviously, hovjever, the value of the 
threshold used here would not be independent of that used 
to detect edges in the initial picttsre processing. 

d) The extension of the technique to ai'eas away from the 
regular field systems of Central California. It is hoped 
that it might be possible to use tiiis technique to locate 
ridge and valley lines in areas of the Sierra Nevada. 



7« C onclusions 

Significant pi’ogress has been, mad© in the handling of ERTS CCT 
data to the extent that it is now possible to compile subsets 
of, that data suitable for generalisation^ and to perform some prelimina-ry 
analyses. Results have been obtained from the following procedures, 

1) The compilation of data sets for both polygonal and 
rectangular U.T,A.'s has shorn that the RASA supplied latitude- 
longitude grid is insufficiently accurate to permit direct compilation 
of U.T.A.'s defined by geographical coordinates. An intermediate 
step to correct for this inaccuracy is necessary. This step is 
performed by locating on the CCT’s points whose geographical 
coordinates are known and determining the cori*ection manually, 

2) Generalization of U.T.A. data to gray scale frec[uency 
distributions has been examined. Studies of the transfer functions 
and sensor ine-iuali ties have shown that differences of 1 or 2 gray 
scale values are not sign5.f leant for our purposes, and that the 

gray scales can therefore only be divided into a mazimum of 52 classes. 

5) Analysis of individual histograms have successfully identified 
the proportions of land and sea in a given image. This was successful 
for KSS bands 6 and 7 , but results for bj^ds 4 and 5 were inconclusive 
and reflect the influence of turbid water in the image. 

4) Preliminary tests have shown that satisfactory classification 
of areas into subsets representing surface types can be made using an 



unsuperviaed clustering algoritlim in which the gray scal.e histograms 
make up the variable set« Detailed tests of the relevance of the 
classification have yet to be made, 

5) An edge detection routine has been developed which has been 
used to locate field boundaries and lake edges in the California 
test site. 

The work to date has solved the initial problems relating to 
selecting areas and generalising data for the UTA’s. We are now 
able to evaluate the effectiveness of various types of generalisation, 
with the knowledge that differences between the generalised data 
represent real effects rather than errors introduced by uncertainties 
in the position or in the gray scale distribution. 

In the next reporting period it is proposed to concentrate 
on the clustering approach, using various size OTTJ*s and different 
levels of generalisation. Preliminary tests of other methods of 
generalisation (for example, textural measurments) vdll be made 
in this period. 
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FIGURE’ CAFTIOKS 

Figure 1, Position errors for the dam wall of Honey Lake. 

The open circles show the position of the dam (marked "by the hexagon) 
as found from 'tiie latitude - longitude ^id included with the CCT 
annotation data. 









Figure 2. This contour mup of gray scale values for the sea 
area near the V7ash, U.K, gi.ves a good indication of the distribution 
of sediment laden water. 
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Figure 3. Unsmoothed gray scale histogram for HSS band 7 taken . 
from a UTi in the Central Valleys, California,. Note how much smoother 
this distributioa is than live corresponding distribution for band 
,5. shown in Fig»7. 




Figiire 4. The mapping of the 64 sensor ooxjnt values onto the 
128 tape count values using the fxmction described in section 3. 
The vertical axis gives the number of sensor counts mapped into 
each tape count » 







Figtcr© 5. G-ray scale histograms for successive groups of tiifelve 
scan lines for an area in the Sierra Nevada mountains. 




Tape Count 


Each class corresponds 
to 2 gray levels.' 



Plgxire 6. The convolution function used to remove the effects of 
uneven tape count probability distribution. The v/idth of 'the 
iimction at half height is given by the equation for A T in section 3. 





Figxtre 7. Smoothed and xmsmoothed gray scale histograms for a 
UTA ill the Central Valley. 





Figure 8. The power ^ectra for a strip one pisel id.de and 
parallel to the edge of the frame for the aea area included in the 
U.K. CCT image » The speotea are plotted as a function of 
vfavelength in scan lines ^ and the povrer is plotted as dB relative 
to' the noise power in 'band 4» 





Figure 9. The power spectra for a strip tlrrough the Sierra 
Nevada range. Axes are as fca? Figure 6, 




Figure 10. Sample output ftom the histogram splitting program 
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Figrrre 11 <, Dendrogram showing results of clustering tiie escperimmtal 
set of OTU'g. The error level is iJie square of the distance he ti^een 
the last tr/o dusters amalgamated. 




Figucre 12. Kaps of the groups produced in the last four 
steps of the clustering procedwe. 
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Figure 15. Graph ahotfing the square of the distance hetu’eaQ 
clusters as a function of the number of dusters. 




Figure 14, A computer print out of an area in the Central Valley 
The field pattern is clearly visible in this band 7 image and 
an irrigation canal crosses the south-vrest comer of the area» 

















Figure 15o Sharp botoxdaries detected 3 ji the area sbovm ia P.Lgol4- 
tising the eight nearest neighbours metljod* A double threshold is 
used so that stars represent a larger difference in gray scale value. 





Figure 16, Sharp houndaries detected usiag the three nearest 
neighbours in the north west q.uadrant, Note that the boundaries 
are more sharply defined tiian those in. Figure 15 » and that no 
significant featvtcea sho\m in Figure 15 are not also shotni here. 
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APPENDIX I 


Th6 Oenera l ist^tio'A of ERTS Data fox* Global S e als In.ve3tIft'ation s 

I. E. Hill, A. C. Armstrong & K» M, Clayton 

(Presented at the Symposium on European Barth Resources 
Satellite Experiments, Pehruary 1974) 


1 H'JTROdhgtion 

This paper describes a computer based system we, at the University 
of East Anglia, are developing for the analysis of imagery from the ERTS 
project. The stai*ting point for our work is a belief that, although 
conventional air photo interpretation techniques can be, and have been, 
\ised to interpret reanotely sensed data from orbital platforms, the 
differences between the data obtained from aircraft and from satellites 
warrant the development, or at least the testing, of new methods of 
analysis. These new methods should be designed to work in pax’allel tath, 
rather tiian in place of, conventional analysis, vfith the aim of 
exploiting the additional information contained in the satellite imagery. 

The advantages of satellite platforms over aircraft platforms are 
their ability- to provide (i) repetitive coverage at regularly spaced 
points in time and (ii) global or regional coverage with a uniform data 
base, whereas a major disadvantage of satellite platforms is the limited 
spatial resolution obtainable from such altitudes. Repetitive coverage 
provides the opportunity for monitoring changes in sui*face reflectance, 
due to either transient phenomena such as flooding and bxirns, periodic 
effects due to seasonal changes in, for example, vegetation, or long 
term changes, such as urban development, forest clearance, etc. Ths 
poor resolution and the global coverage together imply that satellite 
imagery v?ould be more usefully used looking for clmnges on a large spatia 
scale than cl)anges involving areas near the limits of resolution of the 


sensor 



The work described here has accordingly been aimed at developing 
a system for moiaitoring satellite imagery for changes that are 
significant on a regional scale* Probably one of the most important 
uses of such a system would be as a first stage data filter for 
an operational earth observation satellite. There it would be used 
to select, for farther processing, only those images which are 
significantly different from earlier imagery of that area. 

2 TECOTIQUES, 

Because it is proposed that the system be used as a first stage 
data filter, it is important that it can make use of data that has 
undergone the miniiaum amount of pre-processing. It is probable that 
any satellite platform for monitoring the earth's stirface on an 
operational and repetitive basis will return the data to earth as 
a digitized picture, and, for this reason, a computer based system 
has been chosen. Other reasons for the use of digitized data are 
the ability to make use both of the spectral information provided by 
multispectral scanners without producing colour composite photographs, 
a slow process when compared ifith the data acquisition rate, and of the 
full dynamic range of ihe sensors, without the degradation in 
radiometric fidelity introduced by photographic processing. 

For this project large general purpose computers (an I® 570/I65 and 
and ICL 1903E) Ixave been used, and the profjramo Iiavs been written 
in Fortran IV, V?h.ile a large computer and a hi^ level programming 
language are convenient in the exparimental stages of a project such as 
tiiis, it seems liicely that a smallex* special piirpose machine, perhaps with 
some special handwar© functions, or with some analogue processing, could 
be more efficient for an operational syslem. 



The first stage in the analysis of the imagery is generalisation of 

the data to a scale approgriate for a regional surv^. Each SETS frame 
7 

contains 5 x 10 pictiire points for the four MSS Bands, vmex'eas a regional 
map at a scale of, say, 1;10 • could show 10*^ - 10"^ independent data points 
in an equivalent area. The second stage is the recognition step where 
the data is compared with that for previous passes over the area, and 
significant chainges are identified. Kote that these two steps in most 
other computer baaed systems for analysing remotely sensed data, are 
performed in the reverse order, 3?or e 2 a.raple, classification into sva'face 
types is usually done using the spectral sigaaturo for each pictiire point, 
and generalization to large areas (individual fields) is only made 
when the classification results are displayed. It is hoped that by 
suitable choice of the area over which this initial generalisation is 
made, it will be possible to mask the effects of large changes taking 
place in a small area, and so obtain a system which is sensitive only 
to changes which are significant on a larger scale, 5br experimental, 
purposes, Unit Test Areas (UTA's) wlxich are square, vrLth 50 km sides, 
have been chosen. 

The simplest possible form of generalisation is to redxxce the data 
for each UTA to a single munber for each spectral band, this number could 
be the mean radiance, in which case the generalisation is equivalent to 
reducing the resolution of the sensor to match the UTA, or it could be 
some more complex parameter. For most purposes, this would bo to drastic 
a reduction in the information content of the data, althouf^i this level of 
generalization could possibly be used to recognise areas with a hij^ degree 
of cloud cover. 



CoaparisoK of the histograms can be done visually or by means of 
statistical tests. In the latter case tlis Chi -squared and iCoimogprov 
SaimoT tests have been used, 

3 , ERELIMINiiHI KiiiSUhTa 

In the previous 'sections the aims and lin.es of investigation of 
the project have been outlined. Regrettably it is still too early in 
the . experiment to say how effective this technique will be, and in 
the remainder of this paper some early results and problems id.ll be 
discussed. 

The area- wliich has been chosen as a test site is covered by one 
ERTS frame, and is shovm in Figure 1, The area covers part of •iixe Central 
Valley in California and extends as far as 1he coastal ranges in the 
south-i-rest, A large area of the Sierra Nevada range is included on the 
eastern side of the frame. Digital data on computer compatible i;apes 
has been obtained for two successi'v© passes over the area in later summer 
1972 and a 'thdrdset of data has been obtained for spring 1973, In 
addition a second test area in central England v;as selected, but repetitive 
coverage is not yet available for this area. 

One essential requirement of tlie technique is that "(iie gray scale 
distribution in unchanged areas is sufficiently repeatable for changes 
in other areas to be recognised. Figure 2 shora the histograms for all 
four bands for a UTA containing predominantly agricultural land in the 
Central Valley, The only changes vi-sible on photographic prints of the 
imagery relate to a few fields which together make up 1 per cent or less 
of the UTA. The histograms ha%'e a similar shape for both scenes 1038~ 
I8II4 and 1056~18114j but there is a small shift to lower grey scale 

i 

values for all bands for the later p^ias. 


This shift can be accounted 



for by the decrease in solar irradience associated the decrease in 

solar elevation^ Table 1 slicws that the ratios of mean radiances is 
comparable with the ratio of the sine of the elevation angle* 

Althou^ there is very good agreement between the large ecale 
structure of the histograms* particular for band 7* tl>ere are many 
smaller features which are not duplicated* The histograms for bands 
4* 5 and 6 tend to be "splkey", and even in the band 7 histogram 
there is a small spike near the peak of the distribution* These 
spikes are not real structure in the gray scale frequency distribution, 
but originate from the mapping of the sensor gray scale with its 
64 values to the CCT gray scale which has 128 levels' for bands 4, 5 
and 6* and 64 levels for band 7* Because of the non“linear transfer 
function and expanded CCT gray scale for bands 4, 5 and 6, and 
because of the rein. ome trie coi-r actions applied to all bands during the 
transfer, there is not a one-to-one mapping of sensor gray levels 
to CCT gray levels. This is illustrated by Figure 5 which gives the 
gray scale frequency distributions for successive groups of six scan 
lines do\m a strip one quarter of the image in width. The features to 
be noted are the blank columns correspondi.ng to 2 gray scale levels for thi 
band 5 data x^hich continue tbrou^ the distribution for several successive 
lines* The calibration pz’ocoss changes the mapping fXom sensor levels 
to tape levels as the sensitivity of the sensor changes* and over a 
sufficiently large nisnber of lines this tends to smooth the frequency 
distribution . Alteimately* the histograms may be smoothed, by use of 
a suitable weighting function* to give acoxit 32 independent gray levels. 
However the smoothing iiinction for' bands 4* 5 and 6 is dependent on the 
actual gray level and results in a non-linoar relationship bett^een tap>0 
gray scale and radiance. The effect of the non-linearity is to freight 
changes in low reflectance surface cover more heavily than changes in 



the upper half of the radiance scale. 

One part of the coaputer systea wiiich has been exterisi'wely tested 
is the segtnent >diich aligns th© digital data relative to the TJTA» The 
confidence tl'iat can be placed in changes in. surface reflectance detected 
by this system depends on the accuracy of this alignment . For example 
witki 50 km square UTA's a r.m..s. positional error of 1 km jilaces 
a lavrer limit on the fraction of the axea for whidi clianges can be 
reliably detected, of about 5?^ of the total area of the td'A. The 
proposed oomputer system uses the latitude and longitude grid information 
in the annotation record of the CCT to convert the coordinates of the 
comers of the UTA to taps coordinates (scan line, picture element). 

The pre-launch estimate of the accmacy of this conversion was about 
500 metres r.m.s. for the three scenes for which tests have been made, 
the errors are in the range 5~10 km. Blg»4 shows the positions found 
for the dam of Honey Lake, California, fcom the annotation data for 
scones 1038-18114 and 1056-18114. This error is sufficiently large 
to obscure any changes involving less than about one quarter of the 
area of the HTA, and so, for most purposes, is large enough to prevent 
this monitoring techniotu© working. 

The XJTA could of course be more acc?arate.Iy positioned relative to 
the digital data by use of several control points on the groxmd, but this 
has the disadvantage that, for an operational system working on a world 
wide scale, the number of control points would soon become •un.nianagenbJ.e . 
The ideal solution for an operational satellite platfocrm would be imjjrovcd 
attitude and position information and conte'ol, but this might be 
teclmically not feasible. One gj-ternative method, which is being tested. 



is to align one imago -siith tbs gromd by marmal msthods, and then to use 
this primary image as a correlation laask for successive passes* 

As a final example of the sort of difficulties that need to be overcome 
befoore the computer system can be used operationally. Figure 5 shovfs the 
effects of clouds and cloud shadow on the gray scale distribution. The 
histograsus are for the same part of the Sierra Ho vada range, but in one 
case the area is partially cloud covered* This produces a sharp peak at 

radiatioe, as expected, but in additicai the cloud shadow, and structure 
witliin the cloud, are sufficient to make the remainder of lie histograJas 
totally different* The implication of this is that changes in the cloud 
free area coiild only be detected if some spatial information is used to 
remove the whole cloud affected area from boili images before preparing* 
the histograms. 

4 CONCLUSIONS 

The results presented above have tended to emplmsize the difficulties 
tliat have been met in developing the computerized system, and have given 
a rather negative outlook on the usefulness of the technique* At this 
stage hovrever, there is still much development required', and no tests of 
the effectiveness of the system will be possible before difficulties such 
as those described above Iia*?e been solved* Other work proceeding in 
parallel with that described here, does hovrever, give some indication that 
xxsing the gray scale histograms as a method of generalising the data is a 
satisfactory solution* In partici'dar , it seems possible to use the 
histograms alone as a means of dividing the into surface types 

using standard procedures* 







